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1.  Introduction 

■  ^  Central  Planning  of  production  is  usually  performed  under  asym¬ 

metric  information.  The  firm  in  general  has  more  information  about  its 
productive  possibilities  than  the  Central  Planner  (from  now  on  for  short 
CP).  This  Justifies  the  use  of  incentives  schemes  in  which  the  CP  does 
not  directly  fix  activity  levels.  Casual  observation  of  such  existing 
schemes  suggests  that  the  time  dimension  is  crucial  for  their  working. 
Typically  the  CP  revises  the  Incentive  scheme  over  time  to  take  into 
account  the  information  provided  by  the  firm's  performance.  Managers  of 
centrally  planned  economies  as  well  as  economists  have  long  recognized 
that  this  revision  induces  firms  to  underproduce  in  order  to  avoid 

demanding  schemes  in  the  future.  This  is  the  ratchet  effect. _ .p 

Although  the  study  of  incentives  schemes  in  centrally  plannned 
economies  has  become  popular  in  the  late  seventies,  few  researches  have 
focused  on  the  ratchet  effect.  Various  authors  have  investigated  how 
schemes  may  induce  the  firm  to  reveal  its  information  before  producing 
(Fan  ll9V5l,  Bonin  [1976I,  Gindin  [l970l)  or  the  effect  of  a  given 
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incentive  scheme  on  the  firm's  behavior  (see  Weitzman  [19761  on  the  new 
Soviet  Incentive  Scheme).  Also  Weitzman  I1980]  gave  examples  of  the 
ratchet  effect  assuming  a  given  (not  necessarily  optimal)  intertemporal 
scheme.  Holmstrom  | 19791  showed  that  the  revision  procedures  in  the  new 
Soviet  Incentive  Scheme  in  general  dominate  fixed  target  schemes.  The 
two  latter  contributions  rely  on  the  assumption  that  the  CP  commits 
himself  in  advance  to  a  given  scheme.  Although  the  exploration  of  the 
phenomenon  under  this  latter  assumption  is  useful,  it  must  be  stressed 
that  most  often,  in  practice,  the  CP  does  not  commit  himself  to  a  revi¬ 
sion  procedure;  this  latter  fact  seems  to  be  crucial  for  understanding 
qualitative  features  of  observed  ratchet  effects . 

_  _  It  is  the  purpose  of  the  present  paper  to  study  the  ratchet  effect 

under  the  non-commitment  assumption,  which  we  believe  to  be  more  realis¬ 
tic.  The  inadequacy  of  conceptual  tools  made  it  difficult  in  the  past 
to  consider  such  an  assumption.  Now  basic  research  on  dynamic  games 
with  incomplete  information  recently  resulted  in  new  ideas  which  look 
particularly  relevant  for  a  theoretical  analysis  of  the  problem.  — 

V 

The  dynamic  theory  of  the  relationship  CP-firm  that  we  attempt 
here,  relies  on  these  modern  game-theoretic  developments.  Although  our 
model  is  highly  stylized,  it  is  designed  to  incorporate  the  central 
positive  and  normative  aspects  of  the  problem,  as  they  have  been  more  or 
less  explicitly  described  in  previous  theoretical  literature. 

The  content  of  the  paper  can  be  summarized  as  follows:  Section  2 
introduces  the  model.  The  CP  is  assumed  to  have  incomplete  information 
about  the  productivity  of  the  firm  (in  most  of  our  paper,  we  shall 
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suppose  that  this  productivity  can  take  only  two  values).  We  explain 
why  the  CP  does  not  decentralize  the  "first  best”  optimum  by  having  the 
firm  face  the  social  value  of  its  output.  Hence,  in  a  second  best 
framework  we  consider  two  classes  of  schemes:  the  linear  and  the 
totally  non-linear  ones.  As  a  starting  point,  we  study  the  influence  of 
incomplete  information  on  the  optimal  incentive  scheme  in  a  static 
framework  (Section  3).  We  show  that  the  marginal  price  of  output  in  the 
reward  scheme  is  lower  than  its  shadow  price  and  we  explain  why. 

Section  U  introduces  dynamics  and  the  possibility  for  the  CP  to  revise 
the  incentive  scheme  over  time.  To  simplify  we  consider  a  two-period 
model  where  the  essence  of  the  phenomenon  can  be  captured  while  letting 
the  argument  be  reasonably  simple.  To  be  consistent  with  casual  obser¬ 
vation,  we  assume  that  the  CP  does  not  commit  himself  to  a  intertemporal 
incentive  scheme  at  the  start.  Reasons  why  commitment  may  not  be 
possible  are  discussed  in  Section  7*  The  sequential  choice  of  reward 
schemes  by  the  CP  and  outputs  by  the  firm  is  most  naturally  modelled  as 
a  game  between  the  two  partners.  We  show  that  this  game  has  a  unique 
welfare  outcome.  In  the  optimal  incentive  scheme,  the  firm  may  hide  its 
information  or  reveal  it.  Even  in  the  latter  case  the  ratchet  effect 
exists  in  the  sense  that  the  CP  may  choose  a  scheme  which  is  suboptimal 
from  a  static  point  of  view  in  order  to  induce  revelation.  We  also  show 
that  we  should  expect  the  marginal  price  of  output  in  the  reward  scheme 
to  exceed  its  optimal  static  marginal  price.  These  results  are  derived 
in  Section  5. 

Most  of  our  analysis  is  concerned  with  (optimal)  linear  and  non- 
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linear  schemes.  An  important  part  of  the  recent  literature  deals  with 


"intermediate"  schemes,  the  so-called  old  and  new  Soviet  incentive 
schemes.  Section  6  discusses  these  schemes  in  the  light  of  our  model. 

It  is  likely  that  the  relevance  of  this  work  goes  far  beyond 
Central  Planning  since  any  type  of  long  run  relationship  between  two 
agents  under  asymmetric  information,  delegation  within  a  firm,  taxation, 
etc.,  is  not  one-shot.  As  the  principal  then  learns  necessarily  over 
time,  some  form  of  ratchet  effect  necessarily  obtains.  Section  8 
mentions  these  applications  and  stresses  analogies.  But  much  work 
remains  to  be  done  to  formalize  them. 

2.  The  Model 

2.1  The  Set  Up 

a)  Technology :  A  firm  produces  one  good  whose  "social  value"  per 
unit  p  is  exogenously  given.  The  production  parameter  y  is  one¬ 
dimensional.  It  can  be  interpreted  either  as  the  production  level  or  as 
excess  production  above  a  minimum  level  which  can  be  enforced  through 
existing  direct  controls.  The  cost  to  the  firrai/  of  producing  y 
(excess)  units  of  the  good  is  a  function  <|'(y,6)  of  y  and  of  a  produc¬ 
tivity  parsuneter  9;  9  stands  for  the  ability  of  workers  and  managers, 
true  capacity  of  the  firm,  etc.,  and  can  onl^'  be  observed  by  the  firm. 

The  CP  has  some  probability  distribution  f(9)  about  it,  which  is 
common  knowledge.  In  most  of  the  paper,  the  probability  distribution 

will  be  confined  to  two  values  9  and  9(9  <  B)  with  respective 

2/” 

probabilities  v  and  (l  -  v)  .  The  cost  function  will  then  be 
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written  \|i(y)  or  t(y) . 

The  production  cost  can  be  thought  of  as  embodying  the  managers 

and  workers'  effort  and  cannot  be  observed  by  the  CP.  We  shall  make  the 

following  assxamptions  on  ip:  the  cost  increases  with  and  is  convex  in 

output:  ip  <  0,  Ip  >0  and  decreases  with  productivity:  ip„  <  0. 
y  yy  •' 

Furthermore  the  marginal  cost  of  production  decreases  with  productivity: 

Ve  <  0- 

b)  Incentive  Scheme:  The  CP  rewards  the  firm  with  an  incentive 
scheme  R(y),  and  derives  a  utility  py  from  output  y. 

Standard  first  best  analysis  would  suggest  to  reward  the  firm 
according  to  R(y)  =  py;  this  conclusion  will  be  incorrect  here  where  it 
is  assumed  that  there  is  a  social  cost  of  $X  to  transferring  $1  to  the 
firm.  There  are  two  possible  polar  interpretatios  of  this  shadow  price 
X. 

First,  $1  given  to  the  firm  is  worth,  in  terms  of  social  welfare 
$(l  -  X);  the  multiplier  X  then  accounts  for  distortions  associated 
with  the  firm's  distribution  of  its  revenue.  This  interpretation  looks 
particularly  pertinent  to  the  context  of  Central  Planning;  such  an 
organization,  particularly  in  its  soviet-type  version,  stems  from  a 
distrust  of  market  mechanisms,  both  for  their  distributional  effects  on 
individual  incomes  but  also  for  their  alloc>ational  effects  on  invest¬ 
ments.  The  distribution  of  proceeds  py,  would  have  undesirable 
features  when  coming  in  addition  to  existing  money  flows  in  an  otherwise 
planned  economy;  the  surplus  if  distributed  may  generate  uncontrolled 
inequalities  or  if  it  is  invested,  lead  to  relative  capital  accumulation 
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considered  as  unappropriate.  For  a  more  basic  reflection  on  these 
phenomena,  a  general  equilibrium  analysis  would  be  needed;  the 
multiplier  should  then  appear  as  the  outcome  of  a  second  best  optimiza¬ 
tion  under  basic  informational  and  derived  institutional  constraints 
(absence  of  capital  markets,  etc.). 

A  second  interpretation  relates  X  to  the  social  cost  of  raising 
funds.  This  is  a  standard  assumption,  precise  justifications  of  which 
have  been  given  when,  for  example,  funds  are  raised  through  distor¬ 
tionary  taxation.  Justifications  of  a  similar  kind  could  be  given  in 
the  context  of  Central  Planning. 

In  formulating  the  problem  Just  in  the  following,  we  will  stick, 

for  the  sake  of  convenience,  to  the  second  interpretation,  although  the 

reader  will  convince  himself  that,  the  first  story  or  any  mixture  of  the 

first  and  the  second  stories  can  lead  to  the  same  formulation. 

F 

We  formalize  the  firm's  objective  function  ti  as  the  difference 
between  its  revenue  and  its  cost: 

(2.1)  Tf^(y,9)  =  R(y)  -  ')'(y,e) 

It  is  convenient  to  split  the  CP's  objective  function  into  two 

parts.  First,  the  CP  is  concerned  with  the  "rest  of  the  world"  welfare, 

i.e.,  with  the  difference  between  the  social  value  of  output  and  the 

social  cost  of  transferring  R(y)  to  the  firm.  This  first  part  will  be 

referred  to  as  the  CP's  "social  concern"  utility  function  and  denoted 
CP 

1T  .  Sticking  to  the  interpretation  of  X  as  a  shadow  cost  of  raising 


(2.2) 


(y)  =  py  -  (i  +  x)R(y) 
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The  CP  also  takes  into  account  the  firm's  welfare  it  . 

Thus  aggregate  welfare  W  can  be  written: 

P  PP 

(2.3)  w(y,e)  =  IT  +  It  =  py  -  '('{y,6)  -  XR(y) 

F  CP 

It  is  useful  to  represent  indifference  curves  for  it  and  it 

in  the  revenue-output  space*  This  is  done  in  Figures  1  and  2.  We  shall 

assume  that  the  CP  is  risk  neutral.  We  also  assume  that  the  firm  can 

guarantee  itself  some  minimum  utility,  for  example,  as  suggested  above 

by  producin’.;  at  a  minimum  acceptable  level.  Without  loss  of  generality 

we  shall  assume  that  the  minimum  utility  level  is  associated  with  a  zero 

F 

production  and  is  equal  to  0.  Thus  the  firm  maximizes  n  (y,0) 

F 

subject  to  the  individual  rationality  constraint  n  (y,6)  >  0.  If 
max  ii‘(y,0)  <  9  the  firm’s  output  is  zero. 

y 

Let  s  denote  the  ratio  p/(l  +  X).  Note  that  s  is  the  slope 
of  the  CP's  social  concern  indifference  curves.  For  reasons  that  will 
become  clear  shortly,  s  will  be  called  the  shadow  price  of  output. 

An  important  part  of  this  paper  will  be  concerned  with  the  class 
of  linear  schemes,  i.e.,  schemes  of  the  following  type:  R(y)  =  a  +by. 

A  linear  scheme  can  also  be  written  as:  R(y)  =  b(y  -  y^)  where 

y^  =  -(a/b)  can  be  interpreted  as  a  quota,  and  b  as  a  bonus  per  unit 

of  output  above  the  quota .-2/ 

2.2  Static  Incentive  Scheme  Under  Complete  Information 
In  order  to  motivate  the  following,  we  first  consider  the  simple 
case  of  complete  information  about  0.  Let  us  derive  the  optimal  linear 
scheme  (which  in  this  case  is  also  the  optimal  non-linear  scheme) .  Let 
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F 

y(b,0)  denote  the  output  that  maximizes  (y,6).  For  any  given 
bonus  b,  it  is  desirable  from  the  viewpoint  of  aggregate  welfare  to 
choose  the  lump-sum  transfer  a  so  as  to  make  the  firm's  individual 
rationality  constraint  binding:  lowering  it  by  6a>  0,  other  things 
being  equal,  does  not  affect  the  firm's  production  decision  and 
increases  aggregate  welfare  W  by  6W  =  X6a.  Hence 

a  =  >|'{y(b,0)  ,6)  -  by(b,0).  What  bonus  will  the  CP  choose?  b  must  be 
such  that  the  (marginal)  social  price  of  output  net  of  transfer  costs 
equals  the  marginal  value  of  output  for  the  firm  (b).  The  former  is 
nothing  but  the  difference  between  the  social  value  of  one  unit  of 
output  p  and  the  shadow  cost  of  transferring  $b,  Xb.  Thus 

(2.4)  p-Xb=b 


or 


(2.5) 


s 


The  formal  proof  of  this  latter  point  is  as  simple  as  the  heuris¬ 
tics:  given  that  we  have  shown  that  the  individual  rationality 
constraint  of  the  firm  has  to  be  binding  at  the  optimum,  the  CP's 
problem  reduces  to  the  maximization  of  (sy  -  R)  over  the  indifference 
curve  of  the  firm  R  -  '|'(y,0)  =  0,  i.e.,  to  the  unconstrained 
maximization  of  sy  -  'l'(y,0).  The  optimum  is  uniquely  determined  by 


s  and  is  decentralized  through  a  linear  bonus  function  with  b  =  s. 


It  follows : 


Proposition  1:  The  optimal  bonus  under  complete  information  is 
equal  to  the  shadow  price  of  output  s. 

3.  Static  Incentive  Schemes  Under  Incomplete  Information 

We  first  analyze  the  optimal  linear  scheme  when  the  relationship 
between  the  firms  and  the  CP  is  short-run  and  there  are  two  types  of 
firms.  We  then  test  the  robustness  of  our  main  conclusion  to  the  class 
of  eligible  schemes  (by  allowing  non-linear  ones)  and  to  the 
distribution  of  firms. 

a)  Linear  Scheme  With  Two  Types  of  Firms:  We  assume  that  the  CP 
can  choose  only  in  the  class  of  linear  schemes  and  that  the  firm  has  two 
potential  productivities  0  and  0  with  probability  v  and  (l  -  v). 
We  shall  also  assume  that  the  proportion  of  low  productivity  firms  is 
not  "too  low"  relative  to  the  productivity  differential,  so  that  the  CP 
never  wants  to  induce  such  a  firm  not  to  operate  (otherwise  the  problem 
is  trivial).  Let  y(b)  and  y(b)  be  the  firm's  utility-maximizing 
outputs  of  both  types  of  firms  when  the  bonus  is  b.  Note  that 
y(b)  >  y(b).  The  CP  maximizes: 

(3-1)  W  =  vlpy(b)  -  i('(y(b))  -  A(a  +  by(b))l 

+  (1  -  v)(py(b)  -  ii''v(b))  -  X(a  +  by(b))l 

Clearly,  the  lump-sum  transfer  must  make  the  low-productivity 
firm's  individual  rationality  constraint  binding  (the  argument  of 
Section  1  is  transposed  here) : 


(3.2) 


a  =  ♦(y(b))  -  by(b) 


Differentiating  (3.1)  gives: 

(3.3a)  ^  =  (1  X)l(s  -  b)lv  (l  _  v)  g  1  - 

The  first  order  condition  obtains  when: 


(3.3b) 


y(b)  -  y(b) 


-  (1  -  V)  ^(b) 


Since  output  grows  with  the  productivity  and  the  bonus,  we 
conclude  from  (3.3a,b): 


Proposition  2:  Under  incomplete  information  about  the 
productivity  of  the  firm, 

(i)  W  is  a  decreasing  function  of  b  for  b  >  s. 

(ii)  the  optimal  bonus  is  smaller  than  the  shadow  price  of  output 

We  can  give  a  more  intuitive  proof  of  this  proposition  as  follows: 

Assume  that  for  the  optimal  scheme  b  >  s.  Let  B  =  (R,y),  B  =  (R,y) 
be  the  choices  of  firms  B  and  0.  Now  assume  that  the  CP  offers  the 
linear  scheme  with  slope  s  that  leaves  the  low  productivity  firm  with 
a  zero  surplus.  If  the  firm  is  of  type  0,  it  is  indifferent  to  this 
change  of  scheme;  since  the  CP  reaches  a  better  "social  concern" 
indifference  curve,  ex-post  aggregate  welfare,  in  case  the  firm  has  a 
low  productivity,  increases.  Assume  now  that  the  firm  has  type  B.  The 
firm  reacts  to  the  new  scheme  of  slope  s  by  choosing  B'  =  (R',y'). 
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This  change  can  be  decomposed  into  two  steps . 

-  A  change  along  <  "'e  indifference  curve  through  the  original  point 
(B,R) .  This  corresponds  to  a  "substitution  effect".  For  the  same 
reason,  as  above,  the  B  firm  being  indifferent  to  this  move  and  the 
CP's  social  concern  utility  increasing,  aggregate  welfare  increases. 

-  A  change  from  the  upper  indifference  curve  through  B  to  the 
lower  indifference  curve  through  B' .  This  corresponds  to  an  "income 
effect"  and  given  our  separability  assumption  it  only  involves  an  income 
transfer  from  the  firm  to  the  society  and  according  to  (2.3)  this  move 
is  beneficial  from  the  point  of  view  of  social  welfare. 

Hence  if  the  firm  is  of  type  B,  aggregate  welfare  increases  when 
moving  from  a  bonus  scheme  b  >  s  to  a  bonus  scheme  with  b  =  s.  Hence 
b  >  s  cannot  be  optimal.  The  argument  conveys  the  following  intuitive 
ideas:  The  CP  is  forced  to  leave  a  surplus  to  the  high  productivity 
firm  if  he  wants  the  low  productivity  firm  to  produce.  Raising  the 
bonus  above  the  shadow  price  of  output  creates  a  productive  raisalloca- 
tion  and  increases  the  high  productivity  firm's  surplus  (see  below)  and 
thus  cannot  be  optimal 

b)  Varying  the  Probability  Distribution  of  Firms:  It  will  be 

very  useful  for  the  dynamic  study  to  consider  what  happens  when  the 

-F , 

proportion  of  high  and  low  productivity  firms  changes.  Let  n  (v) 

denote  the  high  productivity  firm's  profit  when  the  proportion  of  low 

productivity  firms  is  v  and  the  CP  chooses  the  optimal  linear  scheme 

(a(v),  b(v))  (so  as  to  keep  both  types  active).  Remember  that  the 

F 

profit  of  the  low  productivity  firm  ir  is  equal  to  0. 
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We  can  show: 

Proposition  3: 

(i)  b(v)  increases  with  v. 

Cl 

(ii)  i  (v)  is  continuous  and  increasing  in  v. 

Proposition  3,  which  is  proved  in  Appendix  1,  says  that  the 
optimal  bonus  and  the  profit  of  high  productivity  firms  increase  with 
the  probability  of  a  low  productivity  firm.  This  latter  fact  is  indeed 
the  basic  reason  why  the  firm  would  like  to  pretend  it  has  a 
productivity  lower  than  its  true  one. 

c)  Optimal  Non-Linear  Scheme:  We  continue  with  the  two- type 
case,  but  now  with  a  non-linear  scheme.  The  kind  of  reasoning  involved 
here  is  reminiscent  of  the  analysis  on  optimal  taxation  originating  in 
Mirrlees  (l9Tll  and  in  particular  is  closely  related  to  the  analysis  of 
Guesnerie-Seade  [ 19821  which  is  concerned  with  a  finite  number  of 
types  ^ 

The  CP  chooses  two  pairs  of  output-reward  (y,R)  for  firm  9  and 
(y,R)  for  firm  R.  These  must  satisfy  two  conditions:  Self-selection 
(no  type  wants  to  choose  the  other  type's  pair)  and  individual  rational¬ 
ity  (both  firms  are  free  not  to  operate) .  As  is  usual,  the  only  binding 
self-selection  constraint  corresponds  to  the  high  productivity  firm  not 
pretending  it  has  a  low  productivity.  Thus: 

(3.U)  R  -  ip(y)  =  R  -  ?i(y) 

Furthermore,  the  low  productivity  firm  must  have  no  surplus. 
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(3.5)  R  =  4'(y) 

Otherwise  the  CP  could  transfer  money  to  himself  by  reducing  R,  and 
this  would  only  relax  the  self-selection  constraint. 

Note  that  we  reduced  the  CP's  problem  to  the  choice  of  two 
outputs,  y  determines  R  from  (3.5)  and  (3.^+)  then  gives  R  given 
y.  Thus  the  CP  maximizes: 

(3-6)  Max  {v[py  -  >l'(y)(l  +  X.)l  +  (l  -  v)[py  _  ipCy)  -  X(\(i(y)  +  \p(y)  -  ip(y)l} 

{y,y}  -  -  - 


The  first  order  conditions  for  this  problem  are: 


(3.7) 


^'(y)  =  s 


(3.8) 


s  -  i|''(y)  =  l'l''(y)  -  ^''(y)l 


(3.7)  says  that  the  high  productivity  firm  locally  faces  the  shadow 
price  of  output  s.  This  is  the  familiar  "no-distortion-at-the-top" 
result.  (3.8)  is  somewhat  similar  to  (3.3a,b).  Since  the  marginal  cost 
of  production  decreases  with  productivity,  (3.8)  implies  that  the 
marginal  cost  for  the  low  productivity  firm  is  lower  than  the  shadow 
price  of  output  s.  In  this  sense.  Proposition  2  is  robust  when  one 
considers  non-linear  schemes. 


Long-Run  Relationshij 
Equilibrium 


and  the  Ratchet  Effect:  Existence  of 


rnamic  Model :  We  now  want  to  allow  for  the  possibility 


that  the  CP  learns  over  time  about  the  firm's  constant  productivity 
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Our  framework  is  reminiscent  of  that  employed  by  Fudenberg  and  Tirole 
[1983)  in  their  study  of  sequential  bargaining  under  incomplete 
information.  There  are  two  types  of  firms,  0  and  0(0  <  0)  with 
respective  probabilities  and  (l  -  There  are  two  periods, 

t  =  1,2.  The  social  price  of  output  p  remains  constant  over  time.  We 
first  assume  that  the  CP  chooses  schemes  in  the  linear  class.  The 
timing  is  the  following:  At  the  start  of  the  first  period  the  CP 
chooses  a  scheme  (a^,b^);  the  firm  reacts  to  this  scheme  by  producing 
y^  at  cost  4'(y^,0)  and  is  rewarded  (a^  +  b^y^)*  Then  at  the  start 
of  the  second  period  the  CP  chooses  a  new  scheme  (a^.b^),  and  the  firm 
chooses  a  second  period  output  ^2-  Le^  ^  the  common  discount 
factor.^/  With  straightforward  notation  the  discounted  profit  and 
aggregate  social  welfare  are; 

(I4.I)  =  TT^  +  =  [a^^  +  b^y^  -  ij'(y^,0))  +  +  b2y2  -  'I'(y2.6)l 

{I4.2)  W  =  =  (tt^  +  «’>2^  +  ("5^  +  6Tr^^) 

=  Ipy^^  -  '('(y^^.S)  -  +  b^y^)]  +  6[py2  -  'J'(y2»®)  “  +  b2y2)l 

The  situation  depicted  above  is  a  dynamic  game  between  the  firm  and  the 

CP.  The  CP's  strategy  is  a  sequence  of  schemes  (a, ,b. )  and  {a-,b„)/ 

11  2  2  laj^,b 

where  the  second  period  scheme  is  a  function  of  his  information  at  that 
date,  i.e.,  that  the  firm  has  chosen  to  produce  y  under  the  incentive 

v 

scheme  insist  on  the  fact  that  the  CP  does  not  commit 

himself  in  advance  to  a  given  revision  process).  The  firm's  strategy  is 
a  sequence  of  outputs  y^^  (a^^  ,b^  ,0 )  and  y2(a^  ,bj^  ,a2  ,b2 ,0) ,  where  the 
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output  at  a  given  period  is  a  function  of  the  firm's  information  at  that 
date  (clearly,  y2  will  depend  only  on  and  6  in  equili¬ 

brium).  Mixed  strategies  are  allowed.  To  introduce  the  equilibrium 
notion,  we  must  also  describe  the  CP's  beliefs  about  the  productivity  of 
the  firm  at  the  beginning  of  the  second  period.  Let  ’^1 ’^1  ^ 

denote  the  CP's  posterior  probability  that  the  firm  has  a  low  produc¬ 
tivity.  The  equilibrium  notion  -  Perfect  Bayesian,  or  Sequential 
Equilibrium  in  the  terminology  of  Kreps-Wilson  [1982]  to  which  we  refer 
for  a  careful  analysis  -  requires  that  strategies  and  beliefs  be 
consistent;  i.e.,  that  the  strategies  be  optimal  given  the  belieVs  and 
that  the  beliefs  be  derived  from  strategies  using  Bayes  rule. 

A  Perfect  Bayesian  Equilibrium  is  a  set  of  (possibly  mixed) 
strategies  {(a^,bj^),  y^,  (a^.b^),  y^)  and  of  beliefs  satisfying 

(we  shall  often  omit  the  arguments  of  strategies  and  beliefs  for 
clarity) : 


tp 

Pi)  V  6,  y^  maximizes 

P2)  maximizes  the  expectation  of  W^,  given  the  CP's 

beliefs  firm's  second  period 

strategy  yg. 


F* 

P3)  V  6,  y^  maximizes  given  the  second  period  strategies. 

P4)  maximizes  the  expectation  of  W,  given  the  firm's 

and  the  CP's  subsequent  strategies. 

B)  ’^1 ’^1  ^  Bayes-consistent  with  the  prior  probability 

and  the  firm's  first  period  strategy  y^(aj^,b^,6) . 

(Pl)  to  (PU)  only  describe  the  principles  of  Kuhn's  algorithm  for 
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finding  a  dynamic  perfect  Nash  equilibrium  when  the  probabilities  of 
reaching  the  knots  of  the  tree  are  exogenously  given.  (B)  endogenizes 
these  probabilities  in  conformity  with  Bayes  rule. 

b)  Resolution: 

Note  that  a  given  {a^.b^^}  defines  a  game  between  the  CP  and  the 
firm  which  is  a  subform  of  the  initial  game.  Perfectness  requires  that 
equilibrium  strategies  in  the  larger  initial  game  induce  an  equilibriiim 
for  this  game,  which  we  call  a  continuation  equilibrium. 

We  will  say  that  a  continuation  equilibrium  is  unique  if  the 
observed  equllibrixim  actions  are  unique. 

In  this  section,  we  focus  on  existence  and  uniqueness  of  continua¬ 
tion  equilibria.  We  shall  denote  by  y(b^)  and  yCb^^)  the  outputs 
that  maximize  the  one-period  surplus  of  the  low  and  high  productivity 
firm  when  the  bonus  is  b^  and  first-period 

outputs  when  the  firms  take  into  account  the  effect  of  their  choice  on 
the  second  period  incentive  schemes;  under  the  assumption  that  the  firms 
operate,  these  outputs  do  not  depend  on  a^,  as  we  shall  see  below. 

We  first  notice  that  from  (Pl)  and  (P2)  the  second  period  actions 
of  the  firm  and  the  CP  are  the  same  as  in  the  one-period  game  with  prior 
v^.  They  have  been  analyzed  in  Section  3.  Let  us  now  consider  the 
first  period  actions.  We  first  assume  that  both  types  of  firms  operate 
(we  come  back  to  this  assximption  later).  For  the  moment,  the  statement 
"for  any  bonus  b^"  should  be  qualified  by  "assuming  that  a^^  is  high 
enough  so  that  both  firms  operate". 
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We  first  derive  tvro  necessary  conditions  for  continuation  equili¬ 
brium. 

Lemma  1:  V  b^^ : 

In  other  vrords,  the  optimal  first  period  action  of  the  low  produc¬ 
tivity  firms,  in  the  dynamic  context  and  in  equilibrium,  can  only  be  its 
optimal  one-period  action. 

Proof  of  Lemma  1:  From  Section  3,  we  know  that;,  whatever  his 
beliefs  v^,  the  CP  will  not  allow  the  low  productivity  firm  to  make  a 
strictly  positive  surplus  in  the  second  period.  Thus  this  type  of  firm 
is  only  concerned  with  its  first  period  surplus  it^,  and  chooses 
^l^^l^  so  as  to  maximize  it.  Q.E.D. 

Lemma  2:  V  b^^;  yj^(b^)e{y(b^)  ,y(b^)} 

Proof  of  Lemma  2:  If  output  yj^  belongs  to  the  support  of  the 
high  productivity  firm  strategy,  from  Bayes  rule,  the  CP's  posterior 
beliefs  must  be  equal  to  zero  unless  y^  =  ^(b^).  Assume 

*  y(h^).  Since  the  CP  knows  that  the  firm  has  a  high  productivity 
with  certainty,  the  latter  does  not  obtain  a  positive  second  period 
surplus.  Since,  whatever  the  CP's  beliefs,  the  high  productivity  firm 
can  guarantee  itself  a  non-negative  profit  in  the  second  period, 
switching  from  any  y^  in  the  support  of  its  strategy  and  different 
from  both  y(b^)  and  y(bj^)  to  y(bj^)  would  give  such  a  firm  a 
strictly  higher  profit .  This  shows  that  the  support  can  only  consist  of 
two  values,  as  asserted  in  Lemma  2.  Q.E.D. 
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From  Lemmas  1  and  2,  it  follows  that  there  can  only  be  three  kinds 
of  equilibria: 

1)  Pooling  Equilibrium;  Both  types  of  firm  choose  y(bj^).  The 
CP's  posterior  belief  is  then:  V2(a^ .b^^ ,y(b^) )  =  v^. 

2)  Separating  Equilibrium:  The  low  productivity  firm  produces 
y(b^)  and  the  high  productivity  firm  y(b^).  Then:  V2(aj^,bj^,y(b^))  =  1 
and  V2(a^,b^,y(b^))  =  0. 

3)  Semi-Separating  Equilibrium:  The  low  productivity  firm 
produces  y(bj^)  and  the  high  productivity  firm  randomizes  between 
y(b^)  and  y(b^).  Then: 

''2^^1’^1’^^^1^^  =  0. 

The  determination  of  the  equilibrium  type  relies  on  the  consider¬ 
ation  of  the  (first  period)  cost  of  concealing  productivity  for  a  high 
productivity  firm  when  first  period  bonus  is  b^^ .  It  is  denoted 
A(b^). 

(U.3)  A(bj^)  H  bj^ly(bj^)  -  y(bj^)l  -  l'Ky(b^))  -  ^(y(b^))l 

Note  that  A(0)  =  0  and  that  A(b^)  >  0  for  b^  >  0. 

Clearly,  this  short-run  cost  of  concealment  has  to  be  compared 

-F, 

with  the  long-run  gain.  The  latter  relates  with  6v  (v)  which,  in  the 
notation  of  Section  3,  will  be  the  discounted  surplus  of  the  second 
period  high  productivity  firm  when  the  CP's  a  posteriori  beliefs  are 

V. 

The  interest  of  the  above  comparison  is  precisely  confirmed  by  the 


next  proposition. 
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Proposition  U:  V  b^,  there  exists  a  unique  continuation 
equilibrium. 

If  bj^  is  such  that: 

TJI 

A(b^)  <  6i  (v^)  the  equilibrium  is  pooling 

6i^(l)  >  A(b^)  >  6i^(v^)  the  equilibrium  is  semi- separating 

A(bj^)  >  Aii  (l)  the  equilibrium  is  separating. 

Proof:  Consider  successively  the  three  possible  types  of 
equilibria: 

Pooling  Equilibrium:  In  a  pooling  equilibrium,  the  long-run  loss 
of  playing  y(b^)  instead  of  y(b^)  is  Air  (v^)  since  the  latter  is 
the  high  productivity  firm's  discounted  second  period  surplus  when  the 
CP  does  not  acquire  information  in  the  first  period.  Thus  a  necessary 
condition  for  the  existence  of  a  pooling  equilibrium  is: 

(4.4)  6i^(v^)  >  A(b^) 

Conversely,  let  us  show  that  if  (4.4)  holds,  there  exists  a 
pooling  equilibrium.  For  that,  we  must  complete  equilibrium  actions  and 
beliefs  with  out-of-equilibrium  ones.  Since  the  second  period  stra¬ 
tegies  of  the  firm  and  the  CP  are  uniquely  defined  in  Section  3  for  a 
given  Just  have  to  describe  the  CP's  beliefs  for  out-of¬ 

equilibrium  outputs.  There  are  several  ways  to  choose  these  beliefs  so 
that  the  firms  actually  want  to  pool  at  y(b^).  The  simplest  way  to  do 
so  is  to  take  "optimistic"  (for  the  CP)  beliefs:  V  y^  ^  y(b^):  =  0, 

i.e.,  the  CP  believes  that  the  firm  which  chooses  y^  *  y(bj^)  has  a 
high  productivity;  the  corresponding  second  period  scheme  has  slope  s 


and  lump-sum  transfer  ^2  such  that  the  high  productivity  firm  makes  a 
zero  surplus  (a^  =  'I'(y(s))  -  8y(s)).  Now  it  is  clear  that  the  effi¬ 
cient  firm  does  not  want  to  reveal  its  productivity,  since  the  short-run 
gain  to  doing  so  does  not  exceed  A(b^)  and  the  long-run  loss  equals 
'5ir^(  v^) . 

We  conclude  that  there  exists  a  pooling  equilibrium  if  and  only  if 

is  satisfied,  and  that  this  equilibrium  is  unique.  The  firm, 
whatever  its  type,  produces  y(b^)  in  the  first  period;  in  the  second 
period  the  CP  chooses  a  bonus  b^  =  b(v^)  and  a  lump-sum  transfer  such 
that  the  low  productivity  firm  has  a  zero  surplus.  The  firm  then 
produces  its  optimal  static  output  given  bg. 

Separating  Equilibrium 

In  a  separating  equilibrium,  the  high  productivity  firm  produces 
y(b,  )  in  the  first  period  and  makes  a  zero  surplus  in  the  second.  If 
instead  it  decided  to  produce  y{b^)  and  pool  with  the  low  productivity 
firm,  it  would  lose  &(bj^)  in  the  first  period  and  gain  6i  (l),  as  the 
CP  is  convinced  that  he  faces  firm  0  when  he  observes  y(bj^).  Thus  a 
necessary  condition  for  the  existence  of  a  separating  equilibrium  is: 

(1*.5)  <  A(b^) 

Conversely,  it  is  easy  to  show  that  if  (4.5)  holds,  there  exists  a 
unique  separating  equilibrium  path  that  can  be  supported  by  out-of- 
equilibrium  beliefs  and  strategies  (take  optimistic  beliefs  for  output 
that  differs  from  y(b^)  and  y(b^)).  Along  this  equilibrium  path,  the 
firm,  whatever  its  type,  produces  its  static  optimum  in  the  first 
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period,  the  CP  then  has  complete  information  and  chooses  =  s  and  a^ 
so  as  to  extract  the  whole  surplus  from  the  firm.  The  firm  then 
produces  its  static  optimum  for  bonus  s- 

Semi-Separating  Equilibrium: 

In  a  semi-separating  equilibrium,  the  efficient  firm  is  indiffe¬ 
rent  between  revealing  its  productivity  by  producing  y(b^),  and  pooling 
with  the  low  cost  firm  by  producing  y(b^).  The  short-run  loss  of 
pooling  is  A(b^).  The  long-run  gain  is  6n  (v^)  where  is  the 

CP's  second  period  belief  when  he  observes  y(b^).  Note  that  must 

belong  to  (v^,l)  since  the  low-productivity  firm  produces  y(b^)  with 
probability  one.  Thus  a  necessary  condition  for  existence  of  a  semi- 
separating  equilibrium  is  that  there  exists  in  ( l)  such  that. 

(U.6)  =  A(b^) 

-F/  , 

From  Proposition  3,  (v^)  is  continuous  and  strictly  increasing 

in  v^,  thus  bj^  must  satisfy: 

(U.7)  <  A(b^)  <  6ii^(l) 

Conversely,  assume  that  (U.7)  holds.  Again  from  Proposition  3, 
there  exists  a  unique  in  (v^.l)  such  that  (U.6)  is  satisfied. 

And  therefore  there  exists  a  unique  real  number  x^  in  (O.l)  such 
that,  if  the  efficient  firm  produces  y(b^)  with  probability 
x^  and  y(b^)  with  probability  (l  -  x^),  the  CP's  posterior  belief 
when  he  observes  y(b^),  +  (l  -  Xj^)(l  -  satisfies 

(U.6).  Again  it  is  easy  to  construct  supporting  beliefs  and  strategies 
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oui,-of-equilibrium  (take  optinastic  beliefs  for  outputs  that  differ  from 

8/ 

y(b^)  and  y(b^))* 

To  conclude,  if  b^^  satisfies  (U.7),  there  exists  a  unique  semi- 
separating  equilibrium  path.  The  low  productivity  firm  produces  y(b^) 
and  the  high  productivity  firm  randomizes  between  y(b^)  and  y(b^). 
When  observing  the  non-fully  revealing  output  y(b^),  the  CP  chooses  a 
bonus  b^  =  b(v^)  (where  is  defined  by  (14.6))  and  a.2  so  as  to 

extract  the  low  productivity  firm's  surplus.  When  observing  y(b^),  he 
chooses  bonus  s  and  a.2  so  as  to  extract  the  high  productivity  firm's 
surplus . 

To  finish  the  proof  of  Proposition  k,  we  observe  that  for  a  given 
bj^,  one  of  the  three  mutually  exclusive  conditions  (I+.I4),  (ii.5)  and 
(4.7)  must  hold.  Q.E.D. 

The  continuation  equilibrium  depends  on  the  first  period  cost  of 
concealing  productivity  A(b^),  that  is  represented  in  Figure  3. 

We  shall  say  that  A(b^)  is  "well-behaved”  if  it  increases  with 
bj^  (Figure  3.1/ ;  this  means  that  when  the  bonus  increases,  the  high 
productivity  firm  must  incur  higher  and  higher  losses  in  order  to  mimic 
the  low  productivity  firm.  For  example,  one  can  easily  show  that  if  the 
cost  function  is  quadratic  in  output,  A(b^)  is  increasing  and  convex.^^ 
For  a  "well-behaved"  function,  a  higher  bonus  leads  to  more  separa¬ 
tion.  However  h(b^)  need  not  be  increasing  in  bj^ .  Its  shape  depends 
on  third  derivatives  of  the  cost  function,  on  which  we  have  little 
information.  This  makes  the  study  of  the  optimal  first-period  incentive 
scheme  analytically  complicated. 


Figure  3 

Regions  for  continuation  equilibrium 
pooling,  SS  =  semi-separating,  S  =  separating) 

F i gure  3 • 1 
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c)  Strongly  Selective  Schemes: 

Until  now  we  have  assumed  that  the  CP  did  not  want  to  choose 
incentive  schemes  such  that  the  low  productivity  firm  does  not 
operate.  Tnis  assumption  is  easily  Justified  for  the  second  period 
scheme:  the  low  productivity  firm  produces  y(b^)  in  the  first  period 

and  when  observing  y(bj^),  the  CP  has  belief  ^  Thus  if  in  a 

one-period  situation,  the  CP  wants  both  firms  to  operate,  a  fortiori  he 
does  so  in  the  second  period  when  he  observes  y(b^).  What  about  the 
first-period  scheme?  It  is  easy  to  check  that  a  scheme  such  that  no 
firm  operates  cannot  be  optimal  (it  is  disastrous  from  the  first  period 
point  of  view,  and  does  not  bring  any  information) .  Assume  now  that  the 
first-period  scheme  is  such  that  only  the  efficient  firm  produces.  It 
must  be  the  case  that; 

(4.8)  a^  +  -  ;^(y(b^))  <  0 

and 

(4.9)  +  b^y(bj^)  -  ^(SF(b^))  >  6tt^(1) 

since  the  efficient  firm  can  always  pretend  it  has  low  productivity  by 
not  producing.  This  type  of  scheme  can  be  called  a  strongly  selective 
scheme  since  in  order  to  induce  revelation  it  forces  one  type  of  firm 
not  to  operate.  The  idea  behind  proposing  a  strongly  selective  scheme 
is  to  relax  the  self-selection  constraint:  the  high  productivity  firm 
is  less  tempted  not  to  produce  than  to  produce  y(b^).  In  general  we 
cannot  exclude  on  a  priori  grounds  these  strongly  selective  schemes,  and 


4 
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in  order  to  solve  the  CP's  problem,  we  must  compare  the  best  such  scheme 
(if  these  schemes  exist)  and  the  best  scheme  obtained  under  T;he  assump¬ 
tion  that  both  firms  operate 

In  the  previous  discussion,  we  assumed  that  a  firm  that  do’.'S  not 
produce  in  the  first  period  is  able  to  produce  in  the  second.  One  can 
easily  imagine  circumstances  -  e.g.,  managers  leave  -  under  which  this 
does  not  hold.  Thus  an  alternative  assumption  is  that  only  firms  that 
produce  in  the  first  period  can  produce  in  the  second.  Under  this 
assumption,  also  a  strongly  selective  scheme  cannot  be  ruled  out;  indeed 
an  example  is  given  in  Section  5(c)  in  which  the  best  first-period 
scheme  leaves  the  inefficient  firm  ouc  ("for  ever"),  in  spite  of  the 
fact  that  in  a  static  framework  the  CP  would  want  both  firms  to  operate. 

Not  to  multiply  cases,  we  shall  assume  in  the  next  section  that 
strongly  selective  schemes  are  not  optimal. 

5.  Long-Run  Relationship  and  the  Ratchet  Effect:  The  Optimal 

Dynamic  Policy 

a)  The  CP's  first-period  problem:  key  facts: 

Although  it  is  needed  for  a  comprehensive  study  of  the  phenomenon 
under  consideration,  this  technical  section  can  be  skipped  by  the  reader 
who  is  not  interested  in  the  detailed  proof.  The  results  as  well  as 
comments  are  presented  in  next  subsection. 

The  problem  we  face  now  is  to  find  the  CP's  optimal  first-period 
decision,  i.e.,  the  optimal  b^  (remember  that  if  we  rule  out  strongly 
selective  schemes,  a^  is  a  function  of  existence  of  a 

unique  continuation  equilibrium  for  any  given  b^  makes  the  CP's  first- 
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period  maximization  problem  well-defined.  We  were  not  able  to  prove 
that  in  general  the  solution  to  this  problem  is  unique-,  however,  a 
multiplicity  of  first-period  optimal  schemes  is  not  too  severe  a 
problem,  since,  once  the  CP  has  chosen  (a^,b^),  the  continuation 
equilibrium  is  unique  (there  is  no  problem  of  coordination  between 
equilibria).  The  welfare  outcome  W  is  also  unique. 

One  important  issue,  on  which  one  would  expect  clarifications  from 
the  analysis,  concerns  the  relative  magnitude  of  the  optimal  first- 
period  bonus  in  a  dynamic  context  and  the  static  optimal  bonus.  One  is 
tempted  to  believe  a  priori  that  the  Planner  should  be  more  "generous" 
in  a  dynamic  context  in  order  to  obtain  information.  In  fact,  the 
Planner  problem  is  rather  complex  and  in  order  to  understand  it,  we  have 
to  single  out  a  certain  number  of  key  facts  which  are  gathered  in  the 
next  lemmas. 

A  preliminary  lemma  asserts  that  more  information  is  indeed 
desirable  to  the  Center. 

Lemma  3:  The  expectation  of  the  optimal  second  period  aggregate 
social  welfare  ,  for  a  given  first-period  scheme,  increases  with  the 
number  of  revealing  high  productivity  firms  x^  (or  decreases  with  the 
a  posteriori  probability  that  a  pooling  firm  is  of  high  productivity). 

The  argument  goes  as  follows;  For  the  firms  that  switch  from 
pooling  in  the  reference  situation  to  revealing,  second-period  social 
welfare  is  increased  since  the  second-period  scheme  is  now  the  full 
information  scheme  (a^  =  ^(y(s))  -  sy(s),  b^  =  s).  For  the  pooling 
firms,  the  CP  can  always  duplicate  his  previous  second-period  incomplete 
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information  scheme  in  the  reference  situation  (which  is  now  sub-optimal) 
and  leave  social  welfare  unaffected. 

We  now  consider  the  effect  of  the  first-period  choice  between 
pooling,  semi-separating  and  separating  on  first-period  aggregate 
welfare . 

First,  let  us  assume  that  firms  are  pooling.  First-period 
aggregate  welfare  is; 

W^(b^)  =  py(b^)  -  X(a(b^)  +  -  v^^(y(b^))  -  (l  -  )  ti)(y(b^ ) ) 

where  a(b^)  is  given  by  a(bj^)  +  bj^y{b^)  -  i|'(y(h^))  =  0 
We  prove: 

p 

Lemma  U :  For  b^  <  s,  W  (b^)  increases  with  b^ . 


Proof: 


dW 

db. 


[p  -  Xb^  -  -  (l 


dy 

db 


da 


As  ^'{y{b^))  <  't''(y(b^))  =  ,  it  follows  from  the  computation  of 

da/db^  that: 


dW  .  I- 

db^  "  IP 


(1  +  X)b, 


dy 


Q.E.D. 


Define : 

Wi(bi,Xi)  i  [v^  +  (l  -  v^)(i  _  x^)lW^(b^)  +  [x^(l  -  v^^)l 
Ipy(b^)  -  X(a(bj^)  +  b^y(b^))  -  i^'(y(b^))l 
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Wi(bi,x^)  is  the  aggregate  first  period  welfare  when  the  first-period 
bonus  is  b^^  and  the  proportion  of  revealing  high  productivity  firms 
is  . 

Lemma  3 :  The  first-period  aggregate  welfare  increases  with 

the  proportion  x^  of  revealing  high  productivity  firms,  for  any 
b^  <  s. 

(Note  that  Lemmas  U  and  5  hold  independently  of  the  value  of  the 
parameter  ) , 

Proof :  Assume  that  for  b^  <  s,  one  high  productivity  firm 

switches  from  pooling  to  separating.  This  move  increases  the  first- 
period  profit  of  the  firm.  Put  it  also  increases  the  CP's  "social 
concern"  utility  for  ti'.e  first  period;  social  concern  indifference 
curves  are  straight  lines  of  slope  s  (see  Figure  2)  and  separation 
moves  the  firm's  bundle  to  the  right  along  a  line  of  slope  b^,  i.e.,  on 
a  higher  social  concern  indifference  curve  when  <  s.  Q.E.D. 

The  following  corollai7  is  a  straightforward  implication  of  Lemmas 
3  and  5- 

Corollary  ’3.1:  For  any  <  s,  intertemporal  aggregate  social 
welfare  increases  with  the  number  of  revealing  high  productivity  firms. 

We  now  show  that  if  b^  >  s,  W^(s,l)  >  W^(b^,x^),  x^  *  1. 

Lemma  6 :  A  separating  equilibrium  with  bonus  s  wo\ild  be 
superior  from  the  first-period  aggregate  welfare  point  of  view  to  a 
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semi-separating  (or  pooling)  equilibrium  with  >  s . 

Proof:  Consider  the  reference  situation,  a  semi-separating  or 
pooling  equilibrium  with  b^  >  s  and  compare  it  to  a  situation  of 
separation  occurring  with  bj^  =  s  and  =  a(b^)  -t-  b^y(b^)  -  sy(b^) 
(the  new  budget  line  passes  through  (y(b^),  a(b^)  +  bj^y(b^))). 

In  this  new  situation,  consider  successively; 

-  The  first-period  aggregate  welfare  restricted  to  low  productivity 

firms:  "social  concern"  first-period  utility  remains  constant  but  the 

individual  profit  increases  when  the  firm  reacts  to  the  new  offer. 

-  The  same  argument  holds  for  the  first-period  aggregate  welfare 
restricted  to  high  productivity  firms  which  were  pooling  in  the 
initial  situations, 

-  For  high  productivity  firms  which  separate  in  the  initial  situation, 
the  total  effect  can  be  decomposed  (as  in  the  proof  of  Proposition  l) 
into  two  parts:  a  substitution  effect  along  the  initial  indifference 
curve  which  by  definition  does  not  affect  the  firm's  utility  and 
increases  the  (first-period)  social  concern  utility j  an  income  effect 
which  is  negative  from  the  firm's  point  of  view  but  positive  in  terms 
of  aggregate  welfare. 

Finally,  the  impact  on  first-period  aggregate  welfare  of  switching 
to  the  new  scheme  is  unambiguously  positive. 

Still  assuming  separation,  we  can  now  switch  to  a  scheme  with 
b^  =  s,  bringing  the  low  productivity  firm  to  its  minimum  utility,  a 
change  which  still  increases  first-period  aggregate  welfare  (such  a 
change  has  only  positive  income  effects  in  terms  of  aggregate  welfare). 


Q.E.D. 
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The  preceding  facts  hold  independently  of  any  assumption  on 
A(b^).  The  next  property  relies  on  the  fact  that  A(b^)  increases  with 


Lemma  7 :  Assume  that  A(b^)  is  an  Increasing  function  of  b^^. 
Then  if  b,  <  b(u  )  induces  a  semi-separating  or  pooling  continuation 

j.  X 

equilibrium,  it  cannot  be  a  first-period  optimal  dynamic  bonus. 

Proof:  Let  denote  the  proportion  of  revealing  high 

productivity  firms  with  the  scheme  5^  <  b{v^).  Consider 
V  =  1>(^)  ^he  static  optimal  bonus  associated 

with  V.  From  Proposition  3,  v  >  implies  b(v)  >  b(v^)  (but 
b(v)  <  s).  (Note  that  for  b(v)  the  proportion  of  high  productivity 
revealing  firms  x^  is  strictly  greater  than  x^:  this  follows  from 
the  property  of  A(b^),  Proposition  3,  and  the  consideration  of  formula 
(b.6).) 

Now  instead  of  5^,  take  b(v)  as  first-period  bonus.  The 
induced  change  can  be  decomposed  by  a  thought  experience  into  two  parts: 
-  A  change  from  b^  to  b(v)  leaving  the  number  of  high  productivity 
revealing  firms  unaffected.  For  the  group  of  low  productivity  firms 
and  revealing  high  productivity  firms,  this  change,  by  definition  of 
b(v),  is  favorable  from  the  CP's  viewpoint  in  the  first  period  and 
nothing  is  changed  in  the  second  period.  For  the  subgroup  of  non¬ 
revealing  high  productivity  firms,  the  move  is  also  socially  farorable 
in  the  first  period  (use  Lemma  1*  for  =  O)  and  indifferent  in  the 
second.  So,  the  intertemporal  aggregate  welfare  increases. 
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-  A  change  at  =  b(v)  in  the  proportion  of  revealing  high  product¬ 
ivity  firms .  This  change  is  favorable  to  the  CP  in  the  first  period 
(Lemma  5)  and  in  the  second  period  (Lemma  3)» 

The  conclusion  follows.  Q.E.D. 

b)  The  Optimal  First-Period  Bonus:  Results: 

We  have  now  enough  understanding  of  the  factors  governing  the  CP's 
intertemporal  welfare  to  be  able  to  single  out  conclusions  on  the 
optimal  first-period  dynamic  bonus. 

First,  the  fact  that  the  static  optimal  bonus  is  smaller  than  the 
social  value  of  the  good,  has  a  somewhat  weaker  dynamic  counterpart: 

Proposition  5-  If  the  continuation  equilibrium  associated  with 
the  bonus  s  is  separating,  then  b^,  the  optimal  dynamic  first-period 
bonus,  satisfies 

bj  4  s  . 

Proof;  Suppose  that  b^  >  s.  Then: 

-  either  b^  is  separating,  a  fact  ruled  out  by  Proposition  2(i), 

-  or  b^  is  not  separating  but  from  Lemma  6,  the  scheme  with  bonus  s 

would  be  better  from  the  CP's  first-period  point  of  view.  As  it  is 
also  better  for  the  second-period  aggregate  welfare  (this  is  a  variant 
of  Lemma  3),  a  contradiction  obtains.  Q.E.D. 

Let  us  now  derive  a  lower  bound  for  The  following  is  not 


very  surprising: 
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Proposition  6:  If  the  continuation  equilibrium  associated  with 
=  b(v^)  is  separating,  then: 

bj  =  b(v^) 

Prop f :  Assume  the  contrary:  5^  *  optimal. 

-  5^  cannot  be  separating:  given  that  the  infonnational  contents  of 
5^  and  b(v^)  are  the  same,  this  would  contradict  the  definition  of 
b(v^). 

-  cannot  be  a  pooling  or  semi-separating  equilibrium  with  5^  <  s. 
According  to  Lemma  5,  a  separating  equilibrium  with  b^  =  would  be 
superior  for  the  first-period  aggregate  welfare  to  the  equilibrium 
under  consideration,  it  is  also  superior  for  the  second-period  point 
of  view.  However,  this  hypothetical  separating  equilibrium  associated 
with  bj^  =  is  itself  dominated  by  the  scheme  b^  =  b(v^),  a 
contradiction . 

-  B^  cannot  be  a  pooling  or  semi-separating  equilibrium  with  B^  >  s. 

According  to  Lemma  6,  such  an  equilibrium  would  be  dominated  by  a 
hypothetical  separating  equilibrium  with  b^  =  s,  which  in  turn  is 
dominated  by  b^  =  b(v^),  a  contradiction.  Q.E.D. 

Proposition  6  looks  intuitively  reasonable.  If  the  second  best 
static  bonus  induces  in  addition  full  revelation  in  a  dynamic  context, 
it  is  the  dynamic  optimum.  What  is  surprising  is  rather  that  the  proof 
is  more  intricate  than  expected.  The  reason  is  that  pooling  in  the 
first  period  (or  semi- separation) ,  a  possibility  which  is  not  open  in 
the  second  best  static  problem,  may  be  an  interesting  option  from  the 


viewpoint  of  the  first-period  "social  concern" .  The  fact  that  it  is  not 
so  favorable  after  all  -  the  argument  on  which  the  proof  relies  -  is  a 
consequence  of  Lemmas  5  6wid  6. 

In  Proposition  7,  below,  we  will  see  that  another  intuitively 
appealir.g  assertion,  that  the  dynamic  optimal  first-period  bonus  is 
higher  than  the  second  best  optimal  static  bonus,  holds  only  in 
favorable  cases  and  for  the  same  reason  as  above  requires  a  non- 
straightfoi*ward  proof.  But  beforehand  we  will  give  a  corollary  to 
Proposition  6; 

Corollary:  3  6^  such  that  for  6  <  6^,  the  first-period  dynamic 

optimum  is  b(v^). 

Proof:  Take  6  small  enough  such  that  A(b(v,  ))  >  6  tt^(i). 

-  o  1  o 

Hence  b(v^)  is  necessarily  in  the  separating  zone  and  Proposition  6 
applies.  Note  however  that  5^  <  1;  for  6  =  l,  b(v^)  cannot  be  in  the 
separating  zone  (remember  that  if^(l)  =  A(s)).  Q.E.D. 

Restricting  now  our  attention  to  the  well-behaved  case,  i.e.,  the 
case  in  which  the  incentive  to  conceal  one's  information  decreases  with 
the  bonus,  we  have: 

Proposition  7:  In  the  well-behaved  case,  the  optimal  first-period 
bonus  exceeds  the  optimal  one-period  bonus:  b^  >  b(v^). 

Proof :  Suppose  b^  <  b(v^).  According  to  Lemma  4  and  because  of 
b(v^)  <  s,  b^  cannot  be  pooling  without  being  at  the  frontier  of  semi¬ 
separation.  But  because  of  Lemma  7.  b^^  cannot  be  semi-separating 
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(weakly  or  strictly).  Hence,  if  is  not  semi-separating  it  is 


greater  than  b(v^). 


Proposition  6,  if  b^  is  separating,  it  equals  b(v^).  The 


conclusion  follows. 


Q.E.D. 


c)  An  Example:  Let  us  assume  that  the  cost  function  has  the 
following  simple  form;  4'(y,0)  =  ky  /20  (where  k  >  O) .  The  reader  can 
easily  check  that  for  this  cost  function: 


(5.1) 


y(b,0)  = 


(5.2) 


A(b)  =  ^  1^  -  0)2 


(5.3) 


b(v)  =  s 


v0  +  (l  -  v)0 


5  -  0 


v0  +  (l  -  v)51l  + 


r-?-T 


i^(v)  =  (B  -  0) 


A(b)  is  represented  in  Figure  U.  Let  b  and  b  be  the  bonuses 
that  mark  the  upper  limits  of  the  pooling  and  semi-separating  regions. 
From  (5.2)  and  (5.U)  and  Section  (b),  b  and  b  are  defined  by: 


(5.5) 


b  =  Ub(v^) 


(5.6) 


b  =  us 


where 


Note  that  if  u  <  b(v^)/s,  then  b(v^)  belongs  to  the  separating 
region  and  thus  is  optimal.  This  case  arises  when  the  future  is  not  too 
highly  valued  and  the  productivity  differential  is  important.  On  the 
contrary ,  if  u  is  high,  b  and  b  are  very  high  relative  to  s,  and 
it  becomes  very  costly  to  separate  the  two  types  of  firms.  Moreover, 
the  second-period  social  gain  of  having  better  information  is  very  small 
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if  the  firms  are  very  alike.  Thus  the  optimal  bonus  leads  to 

pooling,  and  from  Lemma  3,  is  greater  than  s.  It  is  also  possible  to 
show  that,  for  some  values  of  the  parameters,  it  is  socially  optimal  to 
induce  semi-separation. 

Consider  now  the  possibility  of  using  strongly  selective 
schemes.  We  shall  assume  that  a  firm  that  does  not  produce  in  the  first 
period  is  able  to  produce  in  the  second.  It  is  clear  that  for  a 
strongly  selective  scheme  to  be  optimal,  the  lump-sum  a^^  must  make 
(4.9)  binding: 

(5.8)  a,  =  6i^(l)  +  iji(y(b^))  -  b^y(b^) 

Let  us  look  for  the  set  of  bonuses  such  that  the  scheme  is 
actually  strongly  selective,  i.e.,  {h.8)  is  satisfied.  From  a  simple 
computation,  this  is  the  case  if  and  only  if: 


(5.9)  '^l  ^ 

We  shall  assume  that  6  is  less  than  1.  Then  clearly  the  best 
strongly  selective  scheme  has  slope  s  and  gives  first-period  welfare 


(5.10) 


^SSS  3  (II  _  v^)[py(s)  -  ii)(y(s))(l  +  X)  -  X(6Tf^(l))l 


(1  -  V  )ps5  B  -  0 

1  [l-^( - ^)1 


2k 


1  +  X 


B 


What  is  the  best  scheme  under  the  assumption  that  both  firms 
operate?  If  P  =  /l(6/((B  -  9)/B)l  is  "high  enough",  say  above  u. 
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inducing  separation  or  semi-separation  implies  a  vor-y  high  bonus  and  a 
first-period  negative  welfare,  as  the  reader  will  check.  Thus  only  a 
bonus  that  induces  firms  to  pool  can  be  preferred  zo  strong  selection. 

p 

The  best  pooling  bonus  can  be  found  by  maximizing  W  (b): 

(5.1l)W^(b)  =  py(b)  -  ^(y(b))(l  -  X)  +  (i  -  v^)(^(y(b))  -  IKy(b))) 


(5-1'^)  (1 


v^)(l  - 


-  r-T- 


5-0  5-0 

- -)  >  (1 - T) 

5  5 


Now,  consider  a  set  of  parame^ers  such  t,h  ir, : 

1)  {5-ll4)  holds. 

2)  In  a  static  framework,  the  CP  is  indifferent  between  having 
both  firms  operate  and  having  only  the  efficient  firm  operate. 
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3)  U  is  "high  enough",  i.e.,  separation  and  semi-separation  are 
not  desirable.  Tedious  considerations  show  that  it  is  indeed 
possible  to  find  such  parameters 

We  claim  that  the  best  scheme  is  then  the  best  strongly  selective 
scheme.  If  the  CP  chooses  a  strongly  selective  scheme,  the  second 
period  welfare  is  nothing  but  the  full  information  one-period  welfare. 

If  the  CP  chooses  to  have  both  firms  pool,  the  second  period  welfare  is 

SSS  P 

the  one-period  welfare  with  prior  v^.  Since  W‘^  >  W  ,  the  strongly 

selective  scheme  dominates  the  pooling  scheme.  One  can  show  that 
Proposition  3  is  val'.d  under  the  two  alternative  assumptions,  i.e.,  that 
a  firm  does  not  produce  in  the  first  period  can  or  cannot  produce  in  the 
second ; 


Proposition  :  Sven  i the  CP  wants  both  types  of  firms  to 
operate  in  a  static  context,  his  best  strategy  in  a  dynamic  framework 
may  result  in  strong  sel>^ction,  i.e.,  elimination  of  the  less  efficient 
firm. 


Proposition  3  stiows  that  the  elimination  of  "lame  ducks",  which  in 
a  dynamic  model  induci’s  more  revelation  from  the  efficient  firm,  may  be 
socially  desirable  even  if  it  is  not  in  a  traditional  static  planning 
model . 

d)  Non-Linear  Schemes;  In  this  section  we  want  to  show  that  the 
main  ideas  developed  for  linear  schemes  carry  over  to  optimal  non-linear 
schemes.  Ag.ain  we  will  confine  ourselves  to  the  two-firm  case.  More 
general  results  for  arbitrary  distributions  are  still  to  be  found. 


-1»0- 


The  CP  chooses  two  pairs  of  incentives  schemes 
in  the  first  period  and  =  {(y2»P-2^’  (y2’^2^^  second.  The 

timing  is  the  same  as  before.  We  will  use  the  following  proposition  and 
lemmas : 

Proposition  U' :  For  a  given  ,  there  exists  a  unique 
continuation  equilibrium. 

Proof:  We  Just  sketch  the  proof  since  it  is  very  similar  to  that 
of  Proposition  U.  Let  A  be  the  cost  for  the  high  productivity  firm  to 
concealing  its  productivity: 

(5.15)  A  H  IR^  -  U,(y^)|  -  1(R^  -  iKy^)| 

Let.  V  be  the  CP's  second  period  beliefs,  ^1,,  vv)  be  the 
efficient  firm's  one-period  surplus  when  the  CP  has  beliefs  v,  and  y'^ 
be  the  output  chosen  by  the  CP  with  beliefs  v  for  firm  9.  It  is  easy 
to  show  that  increases  with  u  and  tliat  ^'-i;l(^)  ~  ’i'{y(s))  -  iKyCs)). 

Thus  if  A  >  (l)  the  efficient  firm  reveals  its  productivity; 

N  L/ 

if  A  <  chooses  and  il'  <  A  < 

the  continuation  equilibrium  is  semi-separating.  Q.E.D. 

Let  us  now  consider  the  CP's  first  period  decision  problem.  We 
shall  assume  that  the  CP  chooses  to  have  both  firms  operate.  Much  of 
the  discussion  of  Sections  l‘(b)  and  5(c)  on  strongly  selective  schemes 
would  carry  over  to  non-linear  schemes.  We  first  prove  the  following 


two  lemmas 
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Lemma  8: 

Prop f :  It  suffices  to  notice  that  the  continuation  equilibrium 
depends  only  on  A  and  not  directly  on  (as  long  as  both  firms 

operate).  Q.E.D. 

Lemma  9:  =  y(s) 

Proof :  Let  P'  =  (y^,R^)  and  consider  the  indifference  curve 
through  P' •  If  <p'(y^)  *  s,  then  the  CP  can  do  better  by  proposing 
instead  of  P'  the  point  P  where  the  tangent  to  the  indifference 
curve  through  P*  is  of  slope  s,  (unless  A  induces  firms  to  pool,  in 
which  case  P'  is  irrelevant).  From  P'  to  P,  A  and  remain  the 

same  and  tr  ‘  increases.  Q.E.D. 

'Jsing  fjemmas  and  9,  we  can  represent  the  CP's  first  period 
decision  problem  with  the  following  diagram: 


We  sec  tha^  'che  CP's  choice  amoun'cs  to  that  of  an  output  for  firm 

0  :  V  and  of  a  cost  of  concealment  for  firm  0  A.  Note  that  the 
■Li 

optimal  choice  of  A  given  will  either  result  in  a  pooling 

equilibrium  or  belong  to  the  interval  (resulting 

in  a  separating  or  semi-separating  equilibrium):  Among  the  A  that 
induce  a  reveaiing  equilibriujn ,  the  CP  always  prefers  the  lowest: 

(l)*  We  can  now  prove  Proposition  6'; 
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Proposltion  6' :  For  the  optimal  first  period  non-linear  scheme, 
the  marginal  cost  of  the  low  productivity  firm  (4»'(yj^))  exceeds  that 
obtained  for  the  one-period  optimal  non-linear  scheme  ( <|»' (y(  v^) ) . 


Proof: 

(s)  First  assume  that  the  CP  induces  a  revealing  equilibrium 
(A  =  6-4(1)).  Then  =  y(  v-,  )  since  the  CP's  optimization  problem  is 
identical  with  the  one-period  problem  with  beliefs  v^,  up  to  the 
additional  separation  cost  (l  -  v^){XA}. 

(P)  Next  assume  that  the  CP  decides  to  have  both  firms  pool  (by 
choosing  e.g.,  A  =  o).  We  claim  that  ip'(y^)  >  s.  Figure  6  is  drawn 
under  the  assumption  that  'l'’(yj^)  <  s: 


Figure  6 

Pooling  equilibrium  <|''(y^)  <  s 


y 


In  Figure  6,  both  firms  pool  at  S.  Clearly  a  point  such  as  S' 
would  be  preferred  by  the  CP  and  the  high  productivity  firm  and  leaves 
the  low  productivity  firm  indif ferent 

(SS)  Lastly  assume  that  the  equilibrium  is  semi-separating.  Let 
us  show  that  if  <(<'(y^)  <  (y{ )  Ihe  CP  can  induce  a  semi-separating 

equilibrium  with  a  higher  welfare  level.  Indeed  assume  that  he  chooses 
y{u^)  instead  of  y^  and  that  he  maintains  the  same  A.  From 
Proposition  U'  ,  the  proportion  of  high  productivity  firms  that 

reveal  their  information  remains  the  same.  Now  perform  a  statistical 
decomposition  into  two  populations:  in  the  first  population,  put  all 
revealing  high  productivity  firms  and  a  fraction  of  low  productivity 
firms  such  that  the  proportion  low/high  productivity  remains  the  same  as 
the  initial  one;  put  the  other  firms  in  the  second  population.  For  the 
first  population,  the  optimum  output  for  the  low  productivity  firm  is 
y(v^).  For  the  second  population,  welfare  is  higher  for  y(v^)  than 
for  y^  from  (P).  Thus  y^  >  ) .  Q.E.D. 

6.  Alternative  Incentive  Schemes 

In  this  paper  we  considered  only  optimal  linear  and  non-linear 
schemes.  There  has  recently  been  a  lot  of  discussion  about  intermediate 
schemes,  in  particular  the  so-called  old  and  new  Soviet  incentive 
schemes  (see,  e.g..  Fan  [l9T5l,  Bonin  [19761,  Ekern  [l9T9l  and  Holmstrom 
[19791 ). 

We  shall  represent  the  old  and  new  Soviet  incentive  schemes  as 
piecewise  linear  schemes  with  respectively  one  and  two  kinks,  such  that 


1 


1 


In  the  old  scheme,  the  CP  chooses  {yQ,o,S,Y} 

and  rewards  the  firm:  [o  +  6(y  -  yQ)l  if  y  ^ 

y  ^  yn-  If’  ^^’6  f’S'f  scheme,  the  CP  chooses  {y 
u  o 

yg  >  0,  y^  >  0,  Y  >  6  >  6  >  0,  Qq  +  6{y^  -  y^)  =  oi^ 

firm  (oq  -  Y(yQ  -  y)|  if  y  <  y^,  (op  +  6(y  -  y^)] 
and  (a^  +  B(y  -  y^^)]  if  y  >  y^^.  This,  of  course 
interpretation  of  the  new  Soviet  incentive  scheme, 
formalization,  the  CP  chooses  the  line  with  slope 


with  yQ>0,  Y>6>0 
and  (a  -  Y{yQ  ~  y)I 
.yi.Ol.B.Y.fi}  vith 
and  rewards  the 

if  y^  >  y  >  yg, 

is  not  the  usual 
In  the  usual 
,  the  slopes  B  and 


Y,  and  two  bounds  and  y^  (y^  <  y^);  then  the  firm  chooses  a 

target  ^2  within  (yQ,yj^)  and  is  rewarded  according  to  the  associated 
old  scheme  {y2^®(y2) where  ^  is  the  revenue  corresponding 
to  y^  on  the  line  with  slope  6  (see  Figure  8).  Similarly,  it  is 
possible  to  interpret  the  old  scheme  as  the  announcement  by  the  firm  of 
a  target  which  is  restricted  by  only  one  constraint.  The  announcement 
interpretation  is  not  relevant  to  our  situation,  in  which  the  social 
price  of  output  is  given.  However,  if  p  is  not  given,  the  announce¬ 
ment  effect  may  matter.  For  example,  it  may  allow  the  CP  to  develop  or 
look  for  a  substitute  supply  of  the  good  earlier  than  would  otherwise  be 
permitted  . 

We  Just  considered  one  kind  of  uncertainty  faced  by  the  CP,  namely 
that  arising  from  incomplete  information.  Assume  now  that  the  firm's 
output  is  random;  and  that  the  social  price  of  output  p  is  not 
given.  One  can,  for  example,  imagine  that  the  firm  produces  an  inter¬ 
mediate  good  for  which  no  close  substitute  exists.  It  is  in  general 
important  that  the  firm's  production  does  not  fall  under  some  threshold 
in  order  not  to  create  a  severe  shortage  of  the  good.  On  the  other 
hand,  a  very  high  production  is  a  relatively  minor  improvement  over 
normal  production  since  downstream  firms  then  form  a  bottleneck.  It  is 
thus  worth  encouraging  managers  and  workers  to  work  very  hard  in 
(industry-specific)  bad  states  of  nature,  and  not  rewarding  them  too 
generously  for  overproduction.  Non-linear  schemes  allow  the  CP  to 
induce  the  firm  to  internalize  to  some  extent  the  non-linearity  of  the 


social  value  of  output. 
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As  we  said,  the  latter  point  as  well  as  the  announcement  effect 
are  absent  in  our  model  since  we  assioraed  that  the  social  price  of  output 
is  given  (for  example,  determined  by  the  world  price).  Hence,  in  our 
context,  the  old  and  new  Soviet  incentive  schemes  -  which  are 
intermediate  between  linear  and  fully  non-linear  schemes  -  are  of 
limited  interest.  Therefore,  we  will  just  mention  without  proofs  the 
following  results  for  the  two-firm  case. 

1)  Static  model: 

(a)  The  best  old  scheme  is  also  the  best  new  scheme  (this  result 
of  course  does  not  generalize  to  more  than  two  firms). 

(b)  The  best  old  scheme  dominates  the  best  linear  scheme.  For 
the  best  old  scheme,  the  shadow  price  of  output  s  exceeds 
the  marginal  cost  of  the  low  productivity  firm,  which  in  turn 
exceeds  the  marginal  cost  of  the  high  productivity  firm  (this 
last  comparison  differs  from  the  usual  conclusion  for  optimal 
fully  non-linear  schemes). 

2)  Dynamic  model:  There  is  a  ratchet  effect,  i.e.,  firms  may 
want  to  hide  their  information  by  not  choosing  their  static  optimizing 
output  (this  result  of  course  holds  for  most  classes  of  incentive 
schemes) . 

An  interesting  line  of  research  would  be  the  analysis  of  the 
relative  sensitivity  of  different  classes  of  schemes  to  the  ratchet 
effect  (for  example:  is  the  payoff  to  pretending  to  have  a  low 
productivity  smaller  in  the  new  Soviet  incentive  scheme  than  for  linear 
schemes? ) . 


T 


Commitment 


We  have  analyzed  the  ratchet  effect  in  the  context  of  a  game.  An 
other  approach  to  this  problem  assumes  that  the  CP  is  able  to  commit 
himself  to  an  intertemporal  sequence  of  incentive  schemes;  in  this  case 
he  announces  the  current  scheme  as  well  as  the  revision  procedure,  and 
the  firm  solves  its  dynamic  programming  problem  given  the  CP's  plans. 
This  is  the  set-up  considered  by  Weitzman  [1980I  and  Holmstrom  [l9T9l  . 
Holmstrom  show  that,  in  spite  of  the  ratchet  effect,  a  revision 
procedure  is  preferable  to  a  fixed  scheme. 

Note  that  the  CP  always  prefers  committing  himself.  Indeed  he  can 
always  reach  our  (unique)  equilibrium  social  welfare  by  duplicating  his 
optimal  strategy( ies)  .  The  firm's  dynamic  programming  response  is  then 
its  optimal  strategy  in  the  non-commitment  game. 

On  the  other  hand,  casual  empiricism  suggests  that  central 
planners  generally  do  not  commit  themselves  to  revising  incentive 
schemes  in  a  given  way.  We  must  then  ask  why  they  do  so.  Although  this 
question  is  out  of  the  scope  of  this  paper,  we  would  like  to  mention 
some  elements  of  answer.  One  possibility,  of  course,  is  that  the  CP  is 
not  aware  of  the  benefits  of  commitment.  But  there  may  be  deeper 
reasons.  First,  commitment  may  not  be  credible.  The  CP  is  free  to 
design  incentive  schemes,  but  also  to  change  them  over  time  at  his 
discretion.  Also  the  planner  may  be  replaced  and  the  new  planner  may 
not  feel  obliged  to  abide  by  the  schemes  designed  by  his  predecessor 
(remember  that  the  time  period  in  planning  is  not  short).  Second,  the 
costs  of  designing  intertemporal  plans  may  be  very  high,  in  particular 
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when  there  is  a  high  number  of  economic  units.  Third,  it  is  possible 
that  the  CP  can  over  time  obtain  new  information  about  the  firm  (other 
than  its  output)  or  about  the  future  social  price  of  output  (for 
example,  the  CP  obtains  inside  information  about  effort,  observes 
indicators  other  than  output,  etc.).  This  fact  by  itself  does  not 
contradict  the  general  proposition  that  in  the  absence  of  "transaction 
costs"  the  CP  is  better  off  committing  himself  if  he  can.  But  it 
greatly  increases  commitment  costs:  The  CP  must  then  give  a  detailed 

account  of  the  future  incentive  schemes  as  a  function  of  his 
IB/ 

information  .dii' 

8 .  Alternative  Interpretations  of  the  Model 

The  idea  that  if  the  relationship  between  two  economic  agents  is 
not  one-shot,  these  have  incentives  to  hide  their  information,  is  a  very 
general  one.  For  example,  the  ratchet  effect  is  ubiquitous  in  all 
relationships  within  a  firm  or  institution  (a  zealous  and  efficient 
typist  quickly  becomes  overloaded  with  work;  a  complacent  employee  or 
manager  gets  to  do  the  ingrate  work,  etc.).  More  generally,  it  pervades 
any  principal-agent  relationship.  An  example  similar  to  the  one 
presented  here  is  that  of  an  efficient  regulated  firm  which  sees  the 
price  for  its  output  revised  downwards  over  time.  Another  example  of 
regulation  concerns  pollution  controls.  Yao  [19821  considers  the 
dynamics  of  regulation  of  automobile  emissions  control  and  its  effect  on 
the  strategy  of  car  manufacturers,  also  from  a  non-commitment  point  of 
view.  The  analysis  of  the  dynamics  of  the  principal-agent  relationship 
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could  also  be  relevant  to  optimal  taxation:  the  government  over  time 
learns  the  ability  of  tax-payers  and  in  principle  might  want  to  use  this 
information  to  design  Individual  tax-schemes.  Why  it  in  general  does 
not  do  so  is  an  interesting  question,  on  which  this  kind  of  analysis  may 
throw  some  light.  Lastly  we  note  that  a  kind  of  ratchet  effect  also 
arises  in  bargaining  situations  (actually  this  analogy  was  the  starting 
point  of  our  analysis).  Consider  two  firms  or  governments  bargaining 
repeatedly  over  different  contracts Each  partner  would  like  to 
establish  a  reputation  for  being  a  tough  bargainer  in  order  not  to  face 
tougher  and  tougher  behavior  from  the  other  bargainer. 
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Appendix  -  Proof  of  Proposition  3 
P 

Proposition  3:  b(v)  and  i  (v)  are  increasing  in 

Proof : 

Let  us  prove  that  the  optimal  bonus  grows  with  the  proportion  of 
low  productivity  firms. 

Let  V  be  greater  than  v  and  suppose  that  b(v')  <  b(v). 
Decompose  the  "population”  {V  ,  1  -  v'  '  of  low  anvl  high  productivity 
firms  into  two  subpopulations. 

The  first  one  has  all  high  productivity  firms  and  v"  = 

(v/(l  -  v))(l  -  v' )  iow  productivity  firms. 

The  second  has  {v'  -  v”)  '..-w  pro.iuc  t  iv  ity  firms  (with 

v'  -  v"  =  (l/(l  -  v))(u'  -  v)  > 

For  the  first  population,  wt  .  f.  na;-  t‘a'  sam“  rimative  composition 
as  the  economy  (v,  1  -  v),  b(v)  ,s  t-at’-r  from  tne  w-ofare  viewpoint 
than  b( v' ) . 

For  the  second  population  oi'  iow  product  i  v ;  ty  f.rms,  we, fare 
increases  with  b,  when  b  <  s,  hence  b\  v)  .s  a ,  tetter  tt.a:. 

b(v').  These  remarks  contradict  the  fact  that  ■  t,.‘. 

F 

To  show  that  t?  is  increasing  in  v,  it  is  .gt,  to  remarx  that 
from  the  firm*s  first  order  conditions  dTT‘/db  ~  y{i  :  -  \■^v)  ^  . 
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1/  The  formalization  of  the  firm  as  a  unique  aggregate  agent  is 

standard  in  the  Literaturt .  It  seems  to  be  an  unavoidable  first 
step  in  the  analysis.  Note  however  that  facts  suggest  that  the 
efficiency  of  incentive  schemes  do  crucially  depend  upon  the  way 
incentives  are  channelled  within  the  firm  through  "delegation" 
systems . 

An  alternative  interpretation  is  that  there  are  a  large  number  of 
firms  and  that  f(9)  describes  the  proportion  of  firms  of 
parameter  0.  A;  thou, --.h  on  many  points  the  following  analysis 
remains  valid  witr.  such  ati  interpretation,  the  comments  made  rely 
on  the  one-firm  assum.pt  son  . 

3/  Non-linear  scheme's  with  a  more  trad.  Itionai  Interpretation  for  a 
.puota  are  discussed  :i,  .'ectioti  ii. 

4/  Let  us  briefly  cv.-nsis-.'r  the  case  when  the  distribution  of  firms  is 

continuous,  the  continuous  density  over  [9  .  ,0  1  being 

min  max 

f(6).  We  can  demonstrate  that  Propositon  2  is  robust  to  the 
distribution  of  firms.  There  is  a  slight  comp’l  ication  added:  One 
can  in  general  not  assum<*  that  the  IP  wants  t.he  lowest  product¬ 
ivity  firm.  ( c  .  )  to  operate.  Let  0  (a.b)  (>  0  .  )  be  the 
m,  H.  o  min 

"cut-off  productivity",  i.e-.,  the  productivity  under  which  the 

;'inn  does  not  want  to  operate:  Max  {a  +  by  -  iliy.B  (a,b))}  =  0. 

y  o 

The  bf-havior  is  l-u'cribed  by  the  following  p'rogram: 

a 

(3.1')  Max  '  ,  V  {py(h,0)  -  A(a  by(b,9))  -  il(y  ( b  ,  6  )  ,  6  ) }  f  (  9  )d0 

{a,b}  ■' 

It  IS  easily  siiown  that  tb.<’  equivalent  of  (3.3b)  is: 

e 

s  _  i.  =  - - - - 

Vu,b)  g  (b,9)f(0)d9 

o 

which  demonstrates  that  s  >  b.  Rather  than  deriving 
observe  that  the  above  proof  carries  over  to  a  continuous 
di.itribution.  ;’'tarting  from  a  bonus  scheme  (a,b)  assumed  to  be 
optimal  and  wheiv  the  "cut-off  firm"  is  0  (a.b),  we  change  it 
into  a  bon\is  sc!i'»me  (a',s)  with  the  same  "cut-off  firm".  The 
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above  ar^'um-'rio  correi:;  omi  in,'.'  to  tht'  ca;;-.-  wiicr''  o  =  B  applies 

her.-'  I'or  every  ti  >  h  when  th..-  'irm  is  ot'  byjje  b , 

a^>'r''gaT:.e  wel:'ari>  increas  e-,  with  tin-  !i"W  sch'Mn'..  iiy  sui.'jr.inp  ovi-r 

6  >  6  (a,b),  we  conolu.i--  tnat  b  ^  s. 
o 

In  fact,  it  is  easy  to  se"  that  th.^  s^t  of  ax  ions  for  the 
princ  ipal-ajcent  relations/. ip  consiiier'-i  in  t.’ii.s  latt'T  article, 
applies  here  (with  slipht  noi  if  ic.at  io.ns )  .  Mer:-''  t.he  res-ilts  i.'i 

This  subsection  can  '.i.'  v/ew-.a  as  p.articula.-  cisos  of  tn.’  more 

c'.ner.ai  prcpos  ;  t.jns.  ef  in- e;  ner  i  e-.h-'i  . 

ih-j  t'act  tiiat  i  a  '.ctrion  aisc../u:.t  factor  not  cruc.oi.  for 

tiie  theory  alt/.o.vh  ;t  s;::/:  iifi-s  the  reaLio/iiny.  how-vr,  w.,-/ 
/..fl  jrent  discount  factor.;,  tr;.,  /neanir.,'  of  W,  tn.-  su:r.  o!' 
diisioanted  pr-.of.t  i.n  i  .‘.liiy  concrr.. -'i"  V'-.iare  (ce-e  i; 

•  t.’.a’'  ».^'UOLiS  • 

ror  notat,  ioril  j;"i:  w.-  sua.*  li;:--  L:,^■  :\ir 

i i’*s  -ifiJ  # 

'iote  however,  that  t;.'  .  trate.v  wnic.n  i 'eio;,  te..i  does  not 

strictly  dominate,  fr-.c:!  ■'..'•m's  :  oir.t  of  vii'V,  other  nixed 

strategies,  ''.'his  inc.^nven  -i.t  feat. if-?  of  th.,'  /oixei  stratep,y 
oquilibriiLTi  wo  hav-'  i.s  ho'w.'ver  matniy  u;  artifact  of  the  extreme 
stylization  of  our  mo-i".  -inu  -^ouid  ii.-:.i;pear  under  tiie  more 


real  istic 

assumpt Ion 

-i 

continuum  o:' 

si'nli.'  values  for  6. 

•Hence  for 

quadra!.  c 

unci  i or. y,  /.(: 

/ 

i;i  "very  weil- 

behaved" . 

I'h  i  3  SUgge 

b  w.  3  X, 

;iat  the  ini'..; 

*nc: 

e  of  hig'tier  order 

der ivat  ive 

s  must  't-.-  s 

tror:.;* 

to  i  n'.luci'  a 

i'a  i 

lure.'  of  the  ass'omption 

Choose  11 

-  u  an.i  re 

jji-.'ml 

r  tliat  A  =  n 

i(n 

-  u)^C)  and  let 

V.  and  ((5  -  0)/u)  oc  "nougii  so  that  (i.la)  is  satisfied 

(and  6  is  sraall.-r  t'/ian  .  .le  just  ii  ive  to  satist'y  condition 

(2)  by  adjusting  A  and  v,  <  v.  [t  is  not  very  difficult  to  see 

that  this  is  pws'.sible  ;5ince  (.}  V  ; ,  the  .'P  -wants  lioth  i'irms  to 
operate  for  A  smail  enourh  (ii)  V  A,  the  fP  wants  only  one  firm 
to  operate  for  sraal  i  ■•/loug.h. 

One  can  also  show  that  for  the  best  scheme  in  the  class  of  pooling 
schemes  if'Cy^)  <  s. 

Note  that  the  benefit  of  pre-production  .announci'ment  exists  for 
the  old  scheme  and  the  line.ar  scheme  .as  well  as  t/ie  new  scheme. 

The  point  is  that  in  the  lin.-ar  and  old  schemes,  the  incentive  to 
announce  the  true  output  is  -w.-aK  (in  that  there  are  other 
optimizing  announcements )  whereas  truth  is  the  only  optimizing 
announcement  in  the  new  scheme. 


Note  that  these  future  incentive  schemes  have  to  be  incentive 
compatible  from  the  point  of  view  of  the  planner  if  the  firm 
cannot  observe  all  the  CP's  new  information. 

Formally  bargaining  sequentially  over  different  contracts  does  not 
differ  much  from  bargaining  sequentially  over  a  given  contract. 
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